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Abstract 

Many investigations of scientific collaboration are based on statisti- 
cal analyses of large networks constructed from bibliographic repositories. 
These investigations often rely on a wealth of bibliographic data, but very 
little or no other information about the individuals in the network, and 
thus, fail to illustrate the broader social and academic landscape in which 
collaboration takes place. In this article, we perform an in-depth longi- 
tudinal analysis of a relatively small network of scientific collaboration 
(N = 291) constructed from the bibliographic record of a research center 
involved in the development and application of sensor network and wire- 
less technologies. We perform a preliminary analysis of selected structural 
properties of the network, computing its range, configuration and topol- 
ogy. We then support our preliminary statistical analysis with an in-depth 
temporal investigation of the assortative mixing of selected node charac- 
teristics, unveiling the researchers' propensity to collaborate preferentially 
with others with a similar academic profile. Our qualitative analysis of 
mixing patterns offers clues as to the nature of the scientific community 
being modeled in relation to its organizational, disciplinary, institutional, 
and international arrangements of collaboration. 
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1 Introduction 



Scientific communities have large, well-established, and relatively well struc- 
tured digital footprints which have increasingly been the focus of specialized 
research. These footprints, composed of scholarly publications and related ar- 
tifacts, have been employed for bibliometric analyses involving coauthorship, 
citation, co-citation, acknowledgments, and other such indicators of scientific 
productivity and knowledge production [TJ. Furthermore, as the majority of 
scientific publications are now available and consulted online, a number of re- 
cent analyses have progressively studied not only the production of scholarly 
artifacts but also their usage. Recent studies include: an analysis of the struc- 
ture of readership networks in the field of education research [5] and a large-scale 
analysis of user request logs (clickstr earns) gathered from scholarly web portals 
4 . In general, these kinds of research are made possible because the scientific 
community is a large social institution that is open to investigation. 

Coauthorship patterns are perhaps the most studied scholarly and scientific phe- 
nomena. Recent studies of coauthorship have analyzed the literature production 
within specific domains such as high energy physics |24j . genetic programming 
[3"rJ] . neuroscience [S], nanoscience [33], library science [3T], economics [T7], or- 
ganizational science [I], and psychology and philosophy [13]. Similar analyses 
have also been comparative in nature and have explored social and normative 
differences of coauthorship behavior across different domains |31j . Moreover, 
an increasing number of studies of this kind have accounted for the evolving 
component of scientific collaboration [2l [TU1 [3"51 |3"4"] . 

More specifically, coauthorship patterns have been widely and actively studied 
from a social network analysis perspective for over two decades [20j [19| [14] . Most 
social network research involved with coauthorship is based upon this underly- 
ing concept: two individuals (nodes) are regarded as coauthors if they appear 
together in the author list of a publication (edge). This relational structure 
works reasonably well when investigating coauthorship patterns in scholarly ar- 
rangements where publications are authored by relatively small groups. It is 
true that some scientific domains have experienced an increase in the number 
of authors per publication making it impossible to discern the nature and ex- 
tent of individual contributions to a publication |12] . A striking example of this 
phenomenon can be found in the domain of high-energy physics where author 
lists for a single publication often include tens or even hundreds of authoring 
researchers [37] . For this reason, a number of recent studies of physics collabora- 
tion supplement traditional analytic techniques with more qualitative methods 
of survey research, i.e., directly asking authors to indicate the real nature of 
their contributions to a publication [35, 3 . However, besides the singular case 
of high-energy physics, the vast majority of scholarly coauthorship networks 
incorporate collaboration circles of only a handful of authors per publication, 
suggesting that coauthorship activity can be adequately employed to construct 
a valid social network of collaboration . 

In this article, we perform a temporal analysis of a coauthorship network con- 
structed from the bibliographic record of a research center involved in the de- 
velopment and application of sensor network and wireless technologies. The 
network studied here is relatively small in size compared to networks generally 
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analyzed in related research. The small size of the collaboration network results 
in a fundamental advantage: besides analyzing certain large-scale structural 
properties of the network, we can explore the social and academic arrangements 
in which collaboration patterns evolve, based on a set of node characteristics. 

The study of node characteristics can provide insights into the level of "ho- 
mophily" in a social network, i.e. the tendency of individuals to create ties 
with similar others |23l for a review]. The homophily principle describes how 
homogeneous a network is in terms of specific sociodemographic, behavioral, 
or interpersonal characteristics. For example, a high level of homophily in a 
friendship network indicates that individuals with certain characteristics — such 
as race, ethnicity, political beliefs, and educational background — tend to make 
friends with individuals with similar characteristics. Many studies of homophily 
are grounded in sociology and investigate patterns of homophily as well as their 
driving forces and their implications. An established method to measure mathe- 
matically the level of homophily in a network is by computing its "assortativity" 
(also "assortative mixing"), i.e., the extent of mixing between similar nodes in 
a network [29 . While many different components of similarity can be investi- 
gated, the vast majority of large-scale studies of networks look at the mixing 
of node degree, i.e. how nodes with similar degree preferentially attach to one 
another. Mixing patterns, however, can also be calculated based on discrete 
node-specific characteristics. In studies of scholarly and scientific collaboration, 
examples of characteristics that have been investigated include research inter- 
ests [TS], academic domain [25], geographical location [32], age group [5J, and 
country of origin |32j of the individuals in the network. Studying networks in 
terms of these node properties can offer insights into the mechanisms by which 
disciplinary, institutional, and spatial arrangements shape, and are shaped by, 
collaboration patterns. 

The present study, exploratory in nature, ties a quantitative analysis of a net- 
work's assortativity structure to a qualitative explanation of the social and 
academic landscape in which such network is embedded. Scholars working with 
scientific collaboration networks arc increasingly becoming interested in ground- 
ing their quantitative analyses in sociological theory. An example is the work 
of Kretschmer & Kretschmer [TB] that investigates whether the distribution of 
co-author pairs' frequencies in a collaboration network can be regarded as a "so- 
cial Gestalt" . They derive a mathematical function to describe social Gestalts 
in networks and apply it on four large scale coauthorship networks (cumulative 
Nij > 2M co-author pairs) to explore the relationship between coauthorship 
and node-based author productivity. 

This study focuses on a much smaller collaboration network (N — 291). The 
small size of the network allows us to both perform a quantitative analysis of 
selected structural properties of the network and provide a a sociological expla- 
nation of our findings, based on an in-depth qualitative analysis of assortative 
mixing patterns. We demonstrate how certain social and academic dynamics, 
for example the emergence of new international collaborations or the inception 
of new inter-departmental efforts, have varying levels of control in the resulting 
topology and configuration of the scientific collaboration network. 
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2 Present study 



This article presents the findings of a study of scientific collaboration at the 
Center for Embedded Networked Sensing (CENS )[] CENS is a National Science 
Foundation (NSF) Science & Technology Center funded in 2002. CENS sup- 
ports interdisciplinary collaborations among faculty, students, and staff of five 
partner universities in Southern California: University of California, Los Ange- 
les (UCLA); University of Southern California (USC); University of California, 
Riverside; California Institute of Technology (Caltech); and University of Cal- 
ifornia, Merced. From 2005, CENS features a headquarter office base located 
within Boelter Hall at UCLA. 

The mission of CENS research is to use sensor network technology to reveal 
previously unobservable phenomena. From its inception, CENS has developed 
and deployed sensor network devices for the study of a wide range of natu- 
ral phenomena, such as seismic activity, fluid contaminant transport, and bird 
breeding behavior. Besides these pursuits in the natural sciences, the social 
and built environments have progressively become the focus of applied CENS 
research: sensing mobile systems are being employed for the study of public 
health, environmental protection, urban planning, and cultural expression. 

The type of research conducted at CENS now spans a wide spectrum of dis- 
ciplines and applications (from biology to seismology, from wireless telecom- 
munications to statistics, from education to environmental science) requiring 
continuous cooperation among individuals that, otherwise, would probably not 
interact beyond the walls of traditional university departments and faculties. 
In such a diverse scholarly and scientific landscape, distributed collaboration 
on multi-disciplinary subjects constitute a fundamental leverage for scientific 
research. 

2.1 Data collection 

Computing the range, population and configuration of an interdisciplinary, 
multi-institutional research center like CENS can be an arduous task. 

"In order to be recognisable as such, a system must be bounded in 
some way. However, as soon as one tries to be specific about the 
boundaries of a system, a number of difficulties become apparent" 
[III p. 139]. 

In the case presented in this article, these difficulties have to do with the in- 
herently open and dynamic nature of modern science research centers. Unlike 
other types of organizational arrangements for which the boundaries are more or 
less evident (e.g. academic institutions and departments, corporate and govern- 
ment centers), many modern research centers and laboratories act as umbrella 
organizations with very flexible and blurry boundaries. CENS, for example, 
includes researchers from multiple institutions and disciplines. CENS scholars 

The website of the Center for Embedded Networked Sensing (CENS) is available online 
at http : //research . cens . ucla . edu/ 
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seamlessly interact with each other within and beyond institutional and depart- 
mental boundaries: collaboration patterns are ubiquitous and non-uniformly 
distributed. Researchers affiliated with CENS may also be affiliated with other 
research laboratories and perform interdisciplinary work on other projects and 
under different affiliations. Moreover, many CENS collaborations include re- 
searchers that are not officially affiliated with the center. In other words, the 
nature and extent of contribution to CENS collaboration depends on a number 
of organizational and scholarly factors, and is not solely restricted to individuals 
officially affiliated with the center. Under these conditions, what is the best way 
to construct a network that accurately captures scientific collaboration of this 
research center? 

Previous environment-specific studies of collaboration have delineated the pop- 
ulation under study by relying on publication data contained in an institutional 
repository p] or domain-specific bibliographic databases [21] to mine patterns 
of coauthorship that take place within a given institution or academic domain, 
respectively. For the purpose of this article, we used a similar mechanism, thus 
including in our population not only CENS members, but also individuals that 
though not officially affiliated, have contributed to the production of CENS or 
CENS-related scholarly publications. 

We constructed a database of publications by assembling the items included 
yearly in the NSF Annual Reports, which contain the official list of documents 
published by CENS for a given fiscal yearj^j For the available reporting pe- 
riod (2002 through 2008) and for every publication, we collected the following 
information: a) author names, b) publication title, c) publication type, d) pub- 
lication venue, and e) publication date. For the purpose of this article, we 
utilized author names and publication dates (years) to construct a coauthorship 
network, i.e. a network consisting of individuals (nodes) that are connected to 
each other (via edges) if they are recorded as authors on the same scholarly 
publication. The resulting bibliographic dataset consisted of 547 manuscripts 
(370 conference proceedings, 159 journal articles, 17 book chapters and 1 book), 
published over a period of 7 years (2001-2007). 

This bibliographic database was used to generate a weighted undirected net- 
work in which nodes represent authors and edges represent coauthorship activ- 
ity among them. For example, if the present paper had to be included in this 
network, its authors (Pepe and Rodriguez) would become two distinct nodes, 
connected by an edge. In order to determine the weights between nodes, i.e. 
the strength of collaboration among coauthors, we used a weighting mechanism 
proposed by Newman [27] by which the weight of the edge between nodes i and 
j is: 

k 

where Sf is 1 if author i collaborated on paper k (and zero otherwise) and 
rifc is the number of coauthors of paper k. For the example above, the edge 
between authors Pepe and Rodriguez would have Wij = 1. An article written 
by three authors (e.g., Pepe, Rodriguez, and Bollen) would result in three edges 

2 CENS Annual Reports are available online at 
http : //research. cens.ucla.edu/about/annual_reports/ 
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(Pepe-Rodriguez, Pepe-Bollcn, and Rodrigucz-Bollcn) , each one with Wij = 0.5. 
And so on. As such, this weighting mechanism confers more weight to small and 
frequent collaborations, based on the assumptions that: i) publications authored 
by a small number of individuals involve stronger interpersonal collaboration 
than multi- authored publications, and ii) authors that have authored multiple 
papers together know each other better on average and thus collaborate more 
strongly than occasional coauthors [27 . 

The resulting network data were "sliced" according to publication year yielding 
to 7 separate networks, each one representing the cumulative collaborative effort 
of CENS researchers up to that year. These networks of coauthorship for years 
from 2002 to 2007 are depicted in Figure [l] (a through f)(^] In all depicted 
networks, the diameter of each node is defined by its eigenvector centrality [6 
and the darker gray nodes denote individuals that are primarily affiliated with 
CENS. 

2.2 Preliminary analysis of network evolution: range, con- 
figuration, and topology 



For each one of the coauthorship networks under study, for years 2001 through 
2007, we calculated some fundamental network statistics, presented in Table [T] 



quantity/year 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


Authors (nodes) 


35 


68 


127 


203 


228 


278 


291 


Publications 


23 


69 


175 


303 


418 


496 


547 


Collaborations (edges) 


182 


346 


690 


1248 


1598 


2158 


2536 


Connected components 


4 


5 


8 


13 


9 


6 


5 


Diameter 


3 


4 


6 


7 


8 


8 


7 


Average path length, £ 


1.543 


2.324 


3.090 


3.038 


3.339 


3.224 


2.944 


Clustering coefficient, C 


0.645 


0.543 


0.432 


0.387 


0.389 


0.337 


0.329 


Degree assortativity, r 


0.432 


0.299 


0.272 


0.187 


0.180 


0.166 


0.165 



Table 1: A summary of the fundamental network statistics of the CENS coau- 
thorship network for the time period 2001-2007: number of authors, publications 
and collaborations (range), number of connected components and diameter of 
the largest connected component (configuration), and average path length, clus- 
tering coefficient and degree assortativity (topology). All values presented are 
cumulative. 

An analysis of these statistics provides insights into the evolution of the CENS 
coauthorship network over time. The first three rows of the table contain, for 
every year in the period under study, the cumulative number of a) authors, 
i.e. nodes in the network, b) publications (journal articles, conference papers, 
etc.), and c) collaborations (coauthoring events), i.e. edges in the network. 
When analyzed over time, these three values all follow a similar trend, which 
highlights two distinct time periods: a first term (2001-2004) during which 
all values increase sharply (roughly doubling in size from year to year), and a 
second term (2004-2007), during which the growth slows down. In particular the 
author count values indicate that CENS quickly became large and diversified in 

3 The network depicting year 2001, though analyzed in this article, is not diagrammed as 
it is very sparse. Its visualization does not inform the present discussion. 



G 




(e) 2006 (f) 2007 



Figure 1 : The CENS coauthorship network under study sliced according to year 
of publication (2002 through 2008, cumulative data). Node diameter represents 
the eigenvector centrality score of the node, where more central nodes have 
larger diameters. Moreover, the darker gray nodes denote individuals that are 
primarily affiliated with CENS. 
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its population in the first term reaching a solid population base of collaborators 
by the year 2004. In the second term, from 2004 to 2007, the author population 
increased again, but to a much lesser extent (from 203 to 291 individuals), while 
the number of published works and collaborations maintained a regular growth 
(from 1248 to 2536 collaborations), suggesting the formation of a core CENS 
authoring base. 

This finding is confirmed by a quick analysis of the network's configuration. The 
number of connected components, i.e. the number of maximal connected sub- 
graphs, goes from 4, in 2001, to 13, in 2004, indicating that the network becomes 
more fragmented in the first term, even if collaboration is overall increasing. In 
the second term, however, the number of connected components drops and the 
network quickly solidifies into a giant component, which indicates a solid base 
of strong collaboration. By looking at the the network diameter, i.e. the length 
of the longest geodesic path in the largest connected component, the formation 
of the giant component in year 2004 becomes evident. This is further reinforced 
by a quick analysis of Table [2j which lists component populations by year. 



year # population 



2001 


4 


18 6 7 4 


2002 


5 


38 16 8 4 2 


2003 


8 


93 10 7 5 4 3 3 2 


2004 


13 


150 10 88543333222 


2005 


9 


205 53333222 


2006 


6 


262 5 4 3 2 2 


2007 


5 


280 4 3 2 2 



Table 2: Node populations of connected components in the CENS coauthorship 
network, by year. 

The preliminary analysis of these first two sets of values from Table [T] gives us 
a good understanding of the evolution of the range and configuration of the 
network over time. A third set of values, presented in Table [I] (average path 
length, clustering coefficient, and degree assortativity) can be investigated to 
provide an in depth understanding of its topology. 

The average path length is the average length of the shortest paths between all 
possible node pairs and, in turn, an indicator of the efficiency of information 
transfer in a social network [39 . Short average path length, and thus high 
information transmission, arc typical characteristics of many real and small- 
world networks [40 . In the network under study, the average path length is 
about 1.5 in 2001; it grows steadily in the first term, reaching a value of about 
3.0, which stays roughly constant throughout the second term. This indicates 
that once the CENS authoring base is formed, an average of 3 steps are necessary 
to transfer information among any two pairs of nodes. Remarkably, this value 
resembles more closely that found in typical small- world networks, such as movie 
actors (£ = 3.65) [40], than that of scholarly coauthorship networks, such as 
mathematics (£ = 9.5) and neuroscience (£ — 6) [2]; yet, this relatively low 
average path is due to the relatively small size of the network analyzed (the 
mathematics and neuroscience coauthorship networks have N — 70, 975 and 
209,293, respectively). 

The clustering coefficient measures the density of clique-like triangles in a net- 
work. High clustering coefficient coupled with short average path length indi- 
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cates that a network exhibits small- world properties [3D]. In the CENS coau- 
thorship network, the clustering coefficient decreases steadily over time, from 
an initial value of 0.645 in 2001, to 0.329 in 2007. This suggests that the net- 
work becomes less cliquish and collaboration patterns becomes more uniform 
across the network over time. This trend reveals that the CENS network ini- 
tially matches the typical topology of highly-clustered disciplines such as physics 
(C = 0.56) and biology (C = 0.6) [SUJ but later drops to the values normally 
recorded in less cliquish domains, such as mathematics (C = 0.34) [T5] . 

A final indicator of network topology presented here is degree assortativity. In 
general, assortativity can be defined as the tendency for individuals (nodes) 
in a social network to establish connections preferentially to other individuals 
with similar characteristics |28j . The most common measure of assortativity is 
computed based on the individuals' degree centrality. In the network presented 
here, degree-based assortativity decreases steadily over time from a value of 
0.432 in 2001 to 0.165 in 2007. Interestingly, the decline of the degree assor- 
tativity measure follows very closely that of the clustering coefficient — the 
correlation between the two is p = 0.964 (p- value < 0.005). This means that 
as collaboration patterns in the network become more sparse and uniform (de- 
creasing C), they also become more mixed (decreasing r), i.e. highly-connected 
individuals begin to collaborate with lowly-connected ones. In the next sec- 
tion, we extend the study of assortativity to a set of discrete characteristics, 
namely authors' academic department, affiliation, position, and country of ori- 
gin. Analysis of these mixing patterns allows us to understand the homophilious 
and hctcrophilious components that contribute to the observed growth of the 
network. 



3 Studying network evolution in terms of dis- 
crete node characteristics 

The preliminary analysis of the CENS coauthorship network presented in the 
previous section reveals the following scenario. In 2001, the network of collab- 
oration is small and very fragmented. During the first few years of activity, 
however, the CENS group grows significantly in the number of authors and col- 
laborations. By the end of 2004, a solid base of collaborating authors (i.e. a giant 
component) is formed. In the analyzed network, small-world effects become less 
prominent over time; in particular, average distance between individuals be- 
comes larger (increasing £), and collaboration patterns in the network become 
more sparse (decreasing C) and more mixed (decreasing r). 

Although our preliminary analysis presents a fairly comprehensive account of 
the range, configuration and topology of the studied network of scientific collab- 
oration over time, we believe that it fails to provide a sociological explanation of 
the dynamics underlying the observed patterns. In particular, we were curious 
to explore further the correlation between clustering coefficient and assortativ- 
ity. Our preliminary analysis indicates that there exists a solid link between 
these two patterns: a) the network becoming more sparse and uniform, and 
b) collaboration patterns becoming more mixed. However, this analysis is re- 
stricted to degree assortativity and thus ignores other mixing patterns that 
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might have contributed to the decrease in network clustering over time. For 
this reason, we became interested in deepening our understanding of the soci- 
ological and academic context of the CENS collaboration network to identify 
specific patterns that might account for the observed clustering pattern. For ex- 
ample, can we speculate that the network becoming more sparse is indicative of 
higher interdisciplinary collaboration and/or higher collaboration across differ- 
ent institutions? In this context, the question that we would like to address is: 
what specific mixing patterns are accountable for the decrease in the network's 
clustering coefficient? In the remainder of this article, we extend the temporal 
analysis, presented in the previous section, to a set of node characteristics. Our 
aim is to justify the observed clustering pattern in terms of specific social and 
academic characteristics that are more telling of the sociological aspects of the 
group than what degree assortativity alone can elucidate. 

3.1 Further data collection 

We collected these additional metadata relative to each author in the network 
under study: a) academic affiliation, b) academic department, c) academic po- 
sition, and d) country of origin. 

We collected these metadata via manual techniques, i.e. gathering required in- 
formation on the authors' personal web pages and consulting online directories 
from university and department web sites. It is worth noting that all the param- 
eters collected (except for country of origin) are subject to change over time, 
even in the short timespan studied in this article. Researchers and scientists 
might change institution, department and position in a seven-year time period. 
For this reason, we consulted not only authors' personal web sites, but also 
their curriculum vitae and biographies to record changes in their academic af- 
filiation, department and position. Curriculum vitae were also useful to collect 
authors' country of origin, which, for the purpose of this study, we consider to 
be the country in which individuals pursued their high-school education. Ta- 
ble [3] presents the frequency counts for the collected author metadata: authors' 
academic affiliation, department, position and country of origin. 

A quick analysis of Table [3] reveals that the collaboration network studied here 
consists mostly of scholars from UCLA and USC from the departments of Com- 
puter Science and Electrical Engineering. The network consists of a large num- 
ber of graduate students)^] but researchers and professors at all levels make up 
a large portion of the collaboration network. Finally, the network is very inter- 
national in its population, with about half of the authors being from the United 
States, about a quarter from India and China and the rest from a number of 
other countries worldwide. 

4 Graduate students are students both at Ph.D. and Master level. Although these have 
been incorporated into a single category, Ph.D. students are the vast majority in this group 
(over 95%). 
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Academic affiliation 

University of California, Los Angeles (UCLA) 148 

University of Southern California (USC) 66 

Massachusetts Institute of Technology (MIT) 10 

California Institute of Technology (Caltcch) 8 

University of California, Riverside (UC Riverside) 7 

University of California, Berkeley (UC Berkeley) 7 

University of California, Merced (UC Merced) 4 
State University of New York at Stony Brook (SUNYSB) 3 



Academic department 

Computer Science 113 

Electrical Engineering 80 

Civil Engineering 23 

Biology 19 

Information Sciences 9 

Environmental Science 7 

Education 5 

Marine Biology 4 

Statistics 3 

Linguistics 3 



Academic position 

Graduate Student 97 

Staff / Research Associate (Staff) 67 

Full Professor (Professor) 44 

Postdoctoral Student (PostDoc) 21 

Associate Professor 21 

Assistant Professor 20 

Undergraduate Student 5 

Lecturer 3 



Country of origin 

United States of America (USA) 120 

India 33 

China 24 

Italy 10 

South Korea (Korea) 9 

Australia 5 

Greece 4 

Iran 3 

Taiwan 3 

Mexico 3 



Table 3: Population counts (10 most recurring values) for the collected node 
properties: authors' academic affiliation, department, position and country of 
origin 
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3.2 Analysis of network evolution by discrete assortative 
mixing 



The temporal analysis of degree assortativity, presented in the previous section, 
indicates the extent to which individuals in the network co-author preferentially 
to other individuals with similar degree centrality. Using the newly collected 
author metadata — academic affiliation, department, position and country of 
origin — we can extend our investigation of assortativity to compute mixing 
patterns based on these discrete parameters. In our case, all analyzed param- 
eters are nominal and we can thus measure discrete assortativity coefficient, r, 
using the following formula [29 : 

_ J2j en — J2 i ajbi 

where is the fraction of edges in a network that connect a node of type i to 
one of type j, on is the fraction of edges that have a node of type i on the head 
of the edge, and bi is the fraction of edges that have a node of type i on the tail 
of the edge. Finally, r = — 1 when there is perfect disassortative mixing, r = 
when there is no assortative mixing, and r = 1 when there is perfect assortative 
mixing. In other words, the discrete assortativity coefficient, r, indicates the 
level of homophily of the network for a certain parameter. For example, if r for 
academic affiliation is 1.0, this means that individuals in the network only write 
papers with other individuals with same institutional affiliation. In this kind 
of network, there are no multi-institutional collaborations. On the other side 
of the spectrum, we can imagine a completely disassortative network (r = —1) 
in which every single collaboration (i.e. paper) in the network is authored by 
individuals that belong to different institutions. 

Table [4] presents the discrete assortativity coefficients for the network under 
study based on authors' academic affiliation, department, position and country 
of origin, calculated at seven temporal snapshots of the network (2001 through 
2007). A visual representation of these values is also presented in the plot of 
Figure [2] 



property / year 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


Academic affiliation 


0.438 


0.448 


0.501 


0.533 


0.550 


0.584 


0.544 


Academic department 


0.463 


0.535 


0.574 


0.560 


0.555 


0.516 


0.474 


Academic position 


0.177 


0.192 


0.188 


0.184 


0.182 


0.177 


0.177 


Country of origin 


0.245 


0.286 


0.369 


0.350 


0.347 


0.352 


0.367 



Table 4: Discrete assortativity coefficients for years 2001 through 2007 based 
on authors' academic affiliation, department, position and country of origin 

Looking at Figure [2] it is easy to deduce that these assortativity coefficients are 
all within the same range - between a minimum of 0.1 and a maximum of 0.6. 
Also, they do not change very much over the period under study — fluctuations 
in the 7- year period rarely exceed 0.1. Overall, the network is more assorta- 
tive by academic affiliation and department, and less assortative by academic 
position and country of origin. A detailed temporal analysis of these coeffi- 
cients can provide insights into the extent and nature of mixing patterns in the 
coauthorship network. For example, by analyzing the assortativity by academic 
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years 



Figure 2: Plot of discrete assortativity coefficients for years 2001 through 2007 
based on authors' academic affiliation, department, position and country of 
origin 



department, we can deduce how inter-disciplinary the CENS network is and how 
inter-disciplinarity has changed over time. However, this analysis fails to reveal 
the specific components that contributed to these mixing patterns, e.g. what col- 
laborations are most responsible for the increase in inter-disciplinarity?. In the 
remainder of this section we present, for each one of the studied characteristics, 
a detailed interpretation of our findings. Our aim is to push our understanding 
of the assortativity coefficients further, decomposing the observed collaboration 
patterns along specific components, to allow a more in-depth temporal analysis 
of the observed mixing patterns. 



3.2.1 Academic affiliation 

From Figure [2j the assortativity coefficient based on nodes' academic affilia- 
tion grows steadily over time, by about 0.1, from 0.438 in 2001 to 0.544 in 
2007. This indicates that, overall, authors in the CENS coauthorship network 
have increasingly collaborated with other authors from the same institutional 
affiliation, in the time period under study. In the latest snapshot of the CENS 
network (year 2007) academic affiliation is the single most assortative character- 
istic, suggesting that CENS authors collaborate preferentially with individuals 
in their institution. This finding matches an earlier observation that the com- 
munity structure of CENS collaboration matches very closely its institutional 
configuration [52"] . 

We would like to investigate this finding further, analyzing the specific intra- and 
inter-institutional collaborations that contributed to making the network more 
assortative over time. In order to do this, we inspect the most recurrent mixing 
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patterns by affiliation in the last snapshot of the network, i.e., the institutional 
pairs that make up the majority of collaboration volume in 2007. These values 
are presented in Table [5j Clearly, as the coauthorship network under study is 
undirected, the order of the pairs (affil-1, affil-2) is not relevant. The top four 
rows in Table [5] present the institutional pairs contributing to intra-institutional 
collaboration, whereas the bottom six rows present the pairs contributing to 
inter-institutional collaboration. So for example, in year 2001 there were 42 
coauthorship activities among individuals affiliated with UCLA. From a network 
perspective, this means that in the year 2001, 42 edges, out of a total of 182 (from 
Table [j}, were among nodes with parameter "UCLA" . Collaboration between 
UCLA researchers increases steadily and reaches 1098 (out of a total of 2536) 
edges in 2007. In Figure[3] we plot the values of Table[5]as a fraction of the total 
volume of collaborations each year. Please note that the order of the institution 
pairs in Figure [3] is reversed with respect to Table [5] so that most prominent 
collaborations occupy the lower portion of the plot . 



affll-l 


affil-2 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


UCLA 


UCLA 


42 


112 


262 


468 


678 


948 


1098 


use 


use 


38 


38 


106 


214 


282 


436 


442 


UC Riverside 


UC Riverside 




2 


4 


4 


16 


24 


24 


Caltech 


Caltech 


10 


16 


20 


20 


20 


20 


20 


UCLA 


use 


56 


60 


80 


144 


166 


212 


324 


UCLA 


MIT 


8 


10 


20 


44 


50 


64 


78 


UCLA 


UC Merced 






6 


6 


8 


26 


58 


UCLA 


Caltech 


4 


4 


14 


16 


16 


18 


32 


UCLA 


UC Riverside 


6 


6 


6 


8 


16 


20 


30 


UCLA 


UC Berkeley 


6 


18 


26 


28 


28 


28 


28 



Table 5: Most recurrent academic affiliation pairs (top 10 results, cumulative 
values). Top 4 rows show intra-institutional collaboration. Bottom 6 rows show 
inter-institutional collaboration. 



The stacked plot of Figure [3] allows us to decompose the assortativity coefficient 
trend lines of Figure [2] for discrete parameter academic affiliation. From Figure 
[2] assortativity by affiliation increases steadily from 2001 to 2006 and finally 
drops slightly from 2006 to 2007. This trend can be understood in terms of 
the growth of intra- and inter-institutional collaborations, presented in Figure 
[3] From the stacked plot of Figure [3] we note that in 2001, the vast ma- 
jority of recorded collaborations involves intra- and inter-institutional efforts 
between UCLA and USC individuals. In year 2007, the picture is not very dif- 
ferent, with UCLA and USC still composing the bulk of the total volume of 
collaborations. However, a closer look at the components of the plot reveals 
that intra-institutional collaboration at UCLA has doubled in volume (from 0.2 
to 0.4) while inter-institutional collaboration (UCLA-USC) has halved (0.3 to 
0.15), compared to 2001 values. USC-USC collaboration stays roughly con- 
stant throughout the period under study. The increase in UCLA-UCLA and 
the decrease of UCLA-USC collaborations are the components that are most 
responsible for the increase in assortativity coefficient by affiliation from 2001 
to 2007, presented in Figure [2] 

There are some other collaboration dynamics that contribute to this trend. For 
example, besides UCLA-USC, the overall inter-institutional effort of UCLA de- 
creases (e.g. collaborations with UC Berkeley and UC Riverside). Moreover, 
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UCLA - UC Berkeley 
UCLA - UC Riverside 
UCLA - Caltech 
UCLA - UC Merced 
UCLA - MIT 

UCLA - USC 

■ Caltech - Caltech 

■ UC Riverside - UC Riverside 

■ USC -USC 

■ UCLA -UCLA 




2001 2003 2005 2007 

2002 2004 2006 

Figure 3: Top ten most recurrent academic affiliation pairs as fraction of total 
volume of collaboration. Darker polygons at the bottom are intra-institutional 
collaboration, while lighter polygons depict inter-institutional collaboration. 

intra-institutional collaborations by Caltech researchers (which make almost 
10% of the total volume in 2001) fade away over time. In sum, by year 2007, the 
collaboration scenario at CENS is largely dominated by publications authored 
within UCLA. Based on this finding, we can conclude that despite CENS's 
mission to be a multi-institutional research center, the temporal decomposition 
of coauthorship patterns demonstrates that CENS collaboration became less 
inter-institutional from 2001 to 2007 consolidating around its main institution, 
UCLA. The steady increase in UCLA-UCLA collaboration can possibly be at- 
tributed to the construction of a CENS headquarter office at UCLA, completed 
in 2005. We can speculate that the CENS headquarter has brought UCLA 
scholars closer to each other, enabling interpersonal collaboration among them 
and, in turn, boosting coauthorship activity^] 

3.2.2 Academic department 

From Figure |2j the assortativity coefficient for academic department has the 
following trend. In year 2001, the CENS network is heavily assortative based 
on academic department (r — 0.463). In the following two years, assortativ- 
ity increases even more, reaching a peak of 0.574 in 2003. This means that 
in 2003, the CENS coauthorship was very highly fragmented by department. 

5 This is a fair assumption especially for the authoring of scientific conference papers, that 
have a much quicker publication turnaround than journal articles. 



15 



By extension, we can speculate that at this time, collaboration patterns were 
vastly mono-disciplinary. In later years, however, assortativity by department 
decreases. Even though the value recorded in 2007 (r = 0.474) is roughly 
equivalent to the network's outset, the trend observed from 2003 to 2007 in- 
dicates the CENS collaboration network becoming more interdisciplinary. A 
decomposition of the observed coauthorship patterns, presented below, enables 
us to understand, more in depth, the extent of intra- and inter-departmental 
collaboration. We present in Table [6] the most recurrent mixing pattern pairs 
by academic department. The stacked plot of Figure [3] depicts these values as 
fraction of total volume of collaboration. 



dept-l 


dept-2 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


Computer Sci 


Computer Sci 


84 


154 


288 


488 


568 


712 


752 


Electrical Eng 


Electrical Eng 


16 


44 


112 


196 


292 


338 


358 


Civil Eng 


Civil Eng 






2 


46 


48 


74 


71 


Biology 


Biology 




10 


24 


28 


48 


62 


66 


Computer Sci 


Electrical Eng 


8 


40 


88 


196 


274 


346 


406 


Computer Sci 


Biology 


18 


18 


18 


20 


20 


60 


80 


Computer Sci 


Civil Eng 








42 


48 


68 


68 


Computer Sci 


Information Sci 


38 


38 


46 


48 


48 


48 


56 


Electrical Eng 


Biology 












10 


38 


Electrical Eng 


Statistics 








10 


22 


28 


36 



Table 6: Most recurrent academic department pairs (top 10 results, cumulative 
values). Top 4 rows show intra-departmental collaboration. Bottom 6 rows 
show inter-departmental collaboration. 



In Figure |4j the darker, bottom four polygons are the department pairs that con- 
tribute the most to intra-departmental collaboration, whereas the top six poly- 
gons present the department pairs that account for most of inter-departmental 
collaboration. At the network's outset, CENS collaboration is dominated by 
intra-departmental collaborations in Computer Science and Electrical Engineer- 
ing. The most prominent inter-departmental collaborations are between Com- 
puter Science and both Biology and Information Sciences. The increase in the 
assortativity coefficient by department from 2001 to 2003 (shown in Figure [2]) 
can be attributed to a number of factors, including (i) a slight increase in collab- 
orations among Electrical Engineers, (ii) the appearance of novel collaborations 
among Biologists, and (iii) a substantial drop in collaborations by Computer 
Scientists with both Biologists and Information Scientists. 

In the long run, however, the intra-departmental volume of collaboration among 
Computer Scientists decreases steadily over time. This decrease, coupled with 
the growth of a number of inter-departmental collaborations (Computer Sci- 
ence with Electrical and Civil Engineering, as well as Electrical Engineering 
with Biology and Statistics) , is most responsible for the assortativity coefficient 
trend presented in Figure [2j i.e., the CENS coauthorship network becomes less 
assortative by department, and thus more inter-disciplinary, over time. 

The observed patterns can be interpreted as follows. First the overall pres- 
ence of intra-departmental collaborations in Computer Science is telling of the 
nature of research being performed at CENS. The domain of networked sens- 
ing emerges historically from computer network research and is thus, normally 
located as a branch in departments of Computer Science. Sensor network tech- 
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Electrical Engineering - Statistics 
Electrical Engineering - Biology 
Computer Science - Information Sciences 
Computer Science - Civil Engineering 
Computer Science - Biology 
-■ Computer Science - Electrical Engineering 

■ Biology - Biology 

■ Civil Engineering - Civil Engineering 

■ Electrical Engineering - Electrical Engineering 

■ Computer Science - Computer Science 




2001 2003 2005 2007 

2002 2004 2006 

Figure 4: Top ten most recurrent academic department pairs as fraction of total 
volume of collaboration. Darker polygons at the bottom are intra-departmental 
collaboration, while lighter polygons depict inter-departmental collaboration. 

nologies, however, require the design and construction of wireless sensors, and, 
in turn, interaction between computer sciences and engineering disciplines fol- 
lows necessarily. This growing incidence of a core set of Electrical Engineering 
collaboration (both intra- and inter-departmental) is evident in Figure [4j It is 
interesting to note that inter-departmental collaborations with Electrical Engi- 
neers involve not only Computer Science, but also Biology (a major scientific 
application area for sensor networks) and Statistics (a discipline increasingly 
required by field scientists to deal with issues related to sensor data cleaning, 
analysis, and modeling). Finally, it is worth noting that the volume of intra- 
and inter-departmental collaborations involving the department of Civil Engi- 
neering increases over time, possibly reflecting the inception in 2004 of a new 
application area at CENS involved in the development and application of sensing 
technologies in urban and social settings. 

In sum, CENS, a research center emerged as a sub-domain of Computer Sci- 
ence, has progressively become more inter-disciplinary over time. The increase 
in inter-disciplinarity can be attributed to CENS' need to develop sensor net- 
work technologies (Electrical Engineering), apply and deploy them in field en- 
vironments (Biology and Civil Engineering), and concurrently deal with data 
analysis issues (Statistics). 
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3.2.3 Academic position 

From Figure [2] it is evident that the discrete assortativity coefficient based on 
academic position has very little influence on the overall topology of the CENS 
coauthorship network, compared to the other computed measures. Assortativ- 
ity by academic position never exceeds a value 0.2 and, very importantly, it 
remains practically unchanged throughout the period under study. This finding 
suggests that the network is very weakly assortative with respect to academic 
position, i.e. coauthorship activities involve the collaboration among scholars of 
all ranks. It is fair to speculate that this finding is a de facto characteristic of 
scholarly publishing in many academic fields. However, a detailed decomposi- 
tion of the most prominent academic position pairs can help reveal the specific 
mixing patterns that influence this assortativity value of the CENS coauthor- 
ship network. Table [7] presents the top ten most prominent academic position 
pairs and the stacked plot of Figure [5] presents these values as fraction of the 
cumulative, yearly volume of collaboration in the CENS network. 



position-l 


position-2 


2001 


2002 


2003 


2004 


2005 


2006 


2007 


Graduate 


Graduate 


10 


16 


52 


108 


134 


172 


184 


Professor 


Professor 


20 


42 


64 


86 


100 


120 


142 


Professor 


Graduate 


34 


64 


124 


218 


262 


338 


374 


Staff 


Graduate 


11 


30 


66 


116 


162 


224 


260 


Professor 


Staff 


20 


36 


76 


116 


148 


194 


242 


Associate Prof 


Graduate 


8 


8 


18 


72 


82 


120 


132 


Associate Prof 


Professor 


11 


26 


34 


76 


86 


106 


130 


Associate Prof 


Staff 


2 


4 


22 


48 


58 


84 


108 


Assistant Prof 


Graduate 


6 


10 


28 


56 


68 


96 


102 


PostDoc 


Graduate 


18 


26 


18 


66 


76 


90 


100 



Table 7: Most recurrent academic position pairs (top 10 results, cumulative 
values). Top 2 rows show collaboration between individuals with same position. 
Bottom 8 rows show collaboration between individuals in different positions. 

In Figure [5] the two darker polygons at the bottom depict the position pairs 
that contribute the most to make the network more assortative — collaborations 
between both graduate students and between full professors. The eight polygons 
in lighter shades of gray depict the volume of collaborations that make the 
network less assortative, for they represent coauthorship activity among scholars 
of different ranks. 

The plot of Figure[5]is very variegated and none of the analyzed pairs stands out 
for volume of collaboration. Moreover, the overall assortativity coefficient by 
academic position remains constant over time. Yet, certain academic position 
pairs display some minor fluctuations which we deem interesting. Looking at 
the darker polygons in Figure [5] we note that collaboration among full professors 
diminishes slightly from 2001 to 2007, while coauthorship between graduate stu- 
dents increases, in the same period. This result suggests the following research 
scenario: at its outset, CENS did not include many graduate students; most of 
the research to "bootstrap" the research center was carried out by faculty mem- 
bers. However, as the center grew in size, more and more graduate students 
became involved and patterns of collaboration among graduates became more 
prominent. 
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Grad Student - PostDoc 
Grad Student - Assistant Prof 
Assoc Prof - Staff 
Assoc Prof - Professor 
Assoc Prof - Grad Student 
Professor - Staff 
Grad Student - Staff 
Professor - Grad Student 
Professor - Professor 
Grad Student - Grad Student 




Figure 5: Top ten most recurrent academic position pairs as fraction of total 
volume of collaboration. Darker polygons at the bottom depict collaborations 
among pairs of individuals with the same academic position, while lighter poly- 
gons depict collaborations between individuals of different academic positions. 

This finding is also reflected in the analysis of collaborating pairs of different 
ranks, i.e., the lighter polygons in the top portion of Figure [5] Again, no major 
fluctuations emerge from a quick visual analysis of the plot. However, if we 
look collectively at collaborations of full professors with graduate students, staff 
members and other professor ranks, we note that they decrease slightly from 
2001 to 2007, while collaborations among graduate students, staff, associate and 
assistant professors increase during the same time period. This finding reinforces 
the research scenario presented above; it indicates that after a "bootstrap" phase 
in which scholarly publication was led by high-ranking professors, a younger 
research network was formed, consisting of graduate students and faculty in the 
early stage of their careers. 



3.2.4 Country of origin 

The final discrete assortativity coefficient we analyze in this article is the country 
of origin of the individuals in the network. Figure [2] shows that this coefficient 
increases steadily in the first two years and then levels off in later years at a value 
around r — 0.35. What specific intra- and inter-national collaborations may 
account for such trend? Even if the vast majority of publications analyzed in this 
study are based on research performed in the United States, it is interesting to 
explore the tendency of individuals to collaborate with others from their country 
of origin, even when they are working and living abroad, or in different countries. 
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We present in Table [8] the ten most prominent countries that contribute to 
mixing patterns by country of origin. Figure [6] depicts these values as a stacked 
plot and as fraction of yearly cumulative volume of collaboration. 



origin- 1 


origin-2 
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2004 
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USA 


USA 


36 


90 


150 


268 


332 


516 
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India 


India 
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6 


40 


72 


92 


104 


110 


China 


China 


6 


6 


36 


38 


38 


38 


38 


USA 


India 


34 


44 


68 
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298 
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USA 


China 
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12 


54 


84 


86 


94 


94 


China 


India 


12 


14 


22 


42 


44 


46 


46 


USA 


Italy 




12 


16 


28 


28 


31 


46 


USA 


Korea 




2 


2 


6 


18 


36 


36 


USA 


Taiwan 


2 


2 


2 


6 


16 


18 


36 


USA 


Iran 






4 


20 


24 


26 


26 



Table 8: Most recurrent country pairs (top 10 results, cumulative values). Top 
3 rows show intra-national collaboration. Bottom 7 rows show inter-national 
collaboration. 




Figure 6: Top ten most recurrent country of origin pairs as fraction of total 
volume of collaboration. Darker polygons at the bottom depict intra-national 
collaboration, while lighter polygons depict inter-national collaboration. 

At the network's outset, the vast majority of collaborations is among Americans 
and between Americans and Indian and Chinese researchers. By year 2003, the 
picture only changes slightly. More intra-national collaborations appear (China- 
China and India-India), while inter-national collaborations between USA, India 
and China drop. These dynamics account for the growth of overall assortativity 
by country of origin recorded from 2001 to 2003, and visible in Figure [2] By 
year 2007, the picture becomes more variegated. Intra-national collaborations 
among American researchers still dominate. However, a number of novel inter- 
national collaborations emerge, namely between USA and Italy, South Korea, 
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and Iran. Connecting this finding with the observation made in the previous 
section that collaboration at CENS was initially "bootstrapped" by faculty 
and later continued by graduate students and younger faculty — it is possible 
to infer that as soon as CENS acquired a solid research core of collaboration, 
by year 2004, it began to attract and involve collaborations by international 
graduate students, researchers and faculty. 

4 Conclusion 

A great deal of research on scientific collaboration is performed on large-scale 
networks, constructed from bibliographic data harvested from domain-based 
and institutional document repositories. While these analyses rely on great 
quantities of data to study the structure, evolution and similar macroscopic 
features of scientific collaboration patterns, they often ignore certain contextual 
and microscopic factors, such as the social and academic arrangements in which 
collaboration takes place. This is because many available bibliographic datasets 
contain detailed publication metadata, but very little or no data about the 
authors writing those publications. 

In this article, we perform a longitudinal analysis of the range, configuration and 
topology of a small network of scientific collaboration over a 7-year period. The 
network presented here is constructed from the bibliographic record of CENS, a 
research center involved in the development and application of sensor network 
technologies. Given the relatively small size of the network (N = 291, in its 
largest year), we were able to manually collect additional metadata for every 
individual in the network studied. We used these node characteristics to explore 
the assortative mixing based on academic department, affiliation, position, and 
country of origin. 

Our findings reveal that, in the period under study, the CENS collaboration 
network: a) becomes more assortative in terms of academic affiliation, i.e. more 
intra-institutional, b) becomes less assortative in terms of academic department, 
i.e. more inter-disciplinary, c) is not assortative in terms of academic position, 
i.e. collaboration patterns are not dependent on researchers' academic positions, 
and d) is only slightly assortative in terms of country of origin, i.e. the extent 
of inter-national collaboration decreases slightly over time. 

We interpreted our findings in terms of the specific components that constitute 
these mixing patterns, finding that a) the increase in intra-institutional collabo- 
ration is possibly caused by CENS research consolidating around its headquar- 
ter base at UCLA, completed in 2004; b) the increase in inter-disciplinarity is 
largely due to the shift a CENS' research agenda, to incorporate new domains, 
such as civil engineering and urban planning, besides the domains traditionally 
associated with sensor network research, i.e., computer science and electrical 
engineering; c) CENS research was initially dominated by collaborations be- 
tween senior researchers, but as the center matured, it became more diversified 
and a core research component consisting of collaborating graduate students 
and young faculty emerged; and d) the volume of international collaboration 
between USA, India, and China decreased but new smaller international efforts 
began as the organization matured. 
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This qualitative explanation of our findings revealed specific small-scale patterns 
that a quantitative analysis of assortativity alone would have failed to uncover. 
We speculate that supporting social network analyses with the proposed qualita- 
tive investigation of mixing patterns can provide a deeper understanding of the 
dynamics that shape (and are in turn shaped) by the changing socio-academic 
landscape in which scientific collaboration takes place. 
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